Chapter 16 ASSOCIATION RULES

نویسنده

  • Frank Hoppner
چکیده

Keywords: Association rules are rules of the kind "70% of the customers who buy vine and cheese also buy grapes". While the traditional field of application is market basket analysis, association rule mining has been applied to various fields since then, which has led to a number of important modifications and extensions. We discuss the most frequently applied approach that is central to many extensions, the Apriori algorithm, and briefly review some applications to other data types, well-known problems of rule evaluation via support and confidence, and extensions of or alternatives to the standard framework. 1 Introduction To increase sales rates at retail a manager may want to offer some discount on certain products when bought in combination. Given the thousands of products in the store, how should they be selected (in order to maximize the profit)? Another possibility is to simply locate products which are often purchased in combination close to each other, to remind a customer, who just rushed into the store to buy product A, that she or he may also need product B. This may prevent the customer from visiting a-possibly different-store to buy B a short time after. The idea of "market basket analysis", the prototypical application of association rule mining, is to find such related products by analysing the content of the customer's market basket to find product associations like "70% of the customers who buy vine and cheese also buy grapes." The task is to find associated products within the set of offered products, as a support for marketing decisions in this case. Thus, for the traditional form of association rule mining the database schema S = {AI, ..., A,) consists of a large number of attributes (n is in the range of several hundred) and the attribute domains are binary, that is, dom(Ai) =

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural Kinds, Evidence, and Randomness in Health Outcomes Research

............................................................................................................................................4 CHAPTER 1: WHAT’S THE PROBLEM? .........................................................................................7 THE NEED FOR MORE AND BETTER HEALTH OUTCOME INFORMATION ................................................7 WHAT’S SO GREAT ABOUT RANDO...

متن کامل

INTEGRATING FUZZY LOGIC WITH DATA MINING METHODS FOR INTRUSION DETECTION By

This report explores integrating fuzzy logic with two data mining methods (association rules and frequency episodes) for intrusion detection. Data mining methods are capable of extracting patterns automatically from a large amount of data. The integration with fuzzy logic can produce more abstract and flexible patterns for intrusion detection, since many quantitative features are involved in in...

متن کامل

Chapter 16 Nonparametric Prediction

In this chapter we consider the prediction of stationary time series for various loss functions: squared loss (as it arises in the regression problem), 0− 1 loss (pattern recognition) and log utility (portfolio selection). The focus is on the construction of universal prediction rules, which are consistent for all possible stationary processes. Such rules can be obtained by combining elementary...

متن کامل

Association Rules: An Overview

Association rules present one of the most versatile techniques for the analysis of binary data, with applications in areas as diverse as retail, bioinformatics, and sociology. In this chapter, the origin of association rules is discussed along with the functions by which association rules are traditionally characterised. Following the formal definition of an association rule, these functions – ...

متن کامل

Frequent closed itemsets based condensed representations for association rules

After more than one decade of researches on association rule mining, efficient and scalable techniques for the discovery of relevant association rules from large high-dimensional datasets are now available. Most initial studies have focused on the development of theoretical frameworks and efficient algorithms and data structures for association rule mining. However, many applications of associa...

متن کامل

Extracting and Evaluating Knowledge from e-Health Documents: A Contribution to Information Retrieval and Indexing

The Internet is a major source of biomedical information. This chapter presents a simple yet efficient approach for extraction of information from biomedical documents available on the Internet. The main objective here is to re-use the information extracted during document retrieval and document indexing. In this work, healthinformation seekers are categorized into three main profiles according...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006